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■ ABSTRACT 

a^ ■ 
o : 

CO . Spatially resolved spectroscopic data from the CIRPASS integral field unit (IFU) on Gemini 

\ are used to measure the gravitational lensing of the 4-image quasar Q2237+0305 on different 

i-S^ ' size scales. A method for measuring the substructure present in the lens using observations 

at multiple wavelengths is demonstrated to be very effective and independent of many of the 
O ' degeneracies inherent in previous methods. The magnification ratios of the QSO's narrow line 

' region (NLR) and broad line region (BLR) are measured and found to be disagree with each 

^ , other and with the published radio and mid-infrared magnification ratios. The disagreement 

^ ' between the BLR ratios and the radio/mid- infrared ratios is interpreted as microlensing by 

stars in the lens galaxy of the BLR. This implies that the mid-infrared emission region is larger 
than the BLR and the BLR is 0.1 pc in size. We find a small difference in the shape of 
' the H/3 line in image A when compared to the other images. We consider this difference too 

small and symmetric to be considered strong evidence for rotation or large scale infall in the 
H/3 emission region. The disagreement between the radio/mid-infrared ratios and the NLR 
ratios is interpreted as a signature of substructure on a larger scale, possibly the missing small 
scale structure predicted by the standard cold dark matter (CDM) model. Extensive lensing 
simulations are performed to obtain a lower limit on the amount of substructure that is required 
to cause this discrepancy as a function of its mass and the radial profile of the host lens. The 
substructure surface density is degenerate with the radial profile of the host lens, but if the 
expectations of the CDM model are taken into account certain radial profiles and substructure 
surface densities can be ruled out. A substructure mass scale as large as 10^ Mq is strongly 
disfavored while 10'' Mq is too small if the radio and mid-infrared emission regions have the 
expected sizes of ^ 10 pc. The standard elliptical isothermal lens mass profile is not compatible 
with a substructure surface density of Sgub < 280 Mq pc~^ at the 95% confidence level. This is 
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4 — 7% of the galaxy's surface density (depending on which image position is used to evaluate 
this) . The required substructure surface density at the required mass scale is high in comparison 
with the present expectations within the CDM model. Lens mass profiles that are flatter than 
isothermal - where the surface density in dark matter is higher at the image positions - are 
compatible with smaller quantities of substructure. 

1. Introduction 

In this paper we present observations of the Icnscd quasi-stellar object (QSO) Q2237-I-0305 
and use them to address fundamental questions concerning the Cold Dark Matter (CDM) 
cosmological model and in addition to investigate the internal structure of the QSO. The CDM 
cosmological model has recently experienced great successes in predicting the distribution of 
galaxies and the fluctuations in the cosmic microwave background. However, there remain some 
challenges to this model on the scale of galaxies themselves. Simulations of structure formation 
within this model predict that a high level of substructure will survive tidal disruption within 
the large dark matter (DM) halos that surround galaxies. N-body simulations indicate that 
there should be plentiful subhalos in a galaxy's halo, with masses above 10'' Mq - smaller 
mass scales are presently inaccessible due to limitations in resolution. If these subhalos contained 
stars, there would be significantly more dwarf galaxies in the Local Group than are seen (Moore 
et al. 1999; Klypin ct al. 2001). 

Various solutions to this "dwarf satellite problem" have been proposed. In particular, if 
the dark matter is warm (WDM) rather than cold then the small scale structure is reduced 
while the large scale (successful) properties of the standard paradigm are preserved. Another 
explanation for the absence of observable substructure is that the star formation in these small 
DM halos was inhibited by ionizing radiation at an early stage in the universe (e.g. Somerville 
(2002); Bullock, Kravtsov, & Weinberg (2000); Dong, Murray, & Lin (2003)). In this case the 
substructure does exist, but is invisible. Metcalf & Madau (2001) showed that this substructure 
could be detected in gravitational lenses. There is already some evidence in support of the 
need for substructure to explain the relative magnifications in strong lenses; disentangling its 
signature from other (also interesting) effects is the goal of this paper. 

The extraction of useful information from QSO lens systems has long been hampered by 
uncertainties and ambiguities related to modeling the lens' mass distribution. This has been 
the case for estimates of the Hubble parameter from lensing and it has necessitated long term 
monitoring in the optical to detect microlcnsing. Recently it has become an important issue 
in testing the CDM cosmological model by detecting the dark substructure predicted in this 
model. From the positions of the images and lens galaxy of a 4-image lens system, a parametric 
smooth mass model for the lensing galaxy and halo can be computed (including external shear), 
along with predictions for the three independent flux ratios. Measuring deviations between the 
predictions and the (diffcrcntial-rcddening-correctcd) observed fluxes in real lenses has been used 
to argue that substructure is present in almost all QSO lens systems (Metcalf & Madau 2001; 
Chiba 2002; Metcalf & Zhao 2002; Dalai & Kochanek 2002). Some substructure has long been 
considered a possible explanation for inconsistencies in the observed flux ratio of B1422+231. 
Mao & Schneider (1998) argued that in this case the source quasar is near a cusp caustic in the 
lensing map and because of this the sum of the magnifications of the three closest images should 
be close to zero for any smooth lens model. B1422+231 and all other known cusp caustic lenses 
violate this relation. It is unclear how far this argument can be taken since the proximity of the 
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source to the caustic depends on the lens model itself (see Gaudi & Fetters 2002) and stellar 
disks may cause the relation to be violated even very close to the cusp (Bradac et al. 2003). 
It is also unclear what mass or size the substructure needs to be for this argument to break 
down, i.e. how to measure properties of the substructure from violations of these relations. In 
any case, most lens systems do not have such advantageously located source quasars and the 
predicted flux ratios can be strongly dependent on the parametric lens models used while the 
image positions are degenerate. Any estimate of the mass fraction in substructure that uses a 
simple lens model e.g. Dalai & Kochanek (2002) is highly suspect. The observational technique 
used here largely avoids this important problem, as well as the problems of intrinsic variability 
(on the timescale of the typical time-delay between images) , differential extinction, scattering in 
the radio and microlensing by stars in the lens galaxy. 

The 4- image gravitational lens Q2237-I-0305, also known as Huchra's lens after its discoverer 
(Huchra ct al. 1985), has been a testbed for studying microlensing for many years. The QSO 
at redshift z = 1.69 is lensed by a single spiral galaxy at z = 0.03 which makes it especially 
advantageous for this. The microlensing of a QSO was first observed by Irwin et al. (1989) in 
this system and since then it has been extensively monitored in the optical (Wozniak et al. 2000). 
Q2237-I-0305 has also been instrumental in studying the structure of the QSO itself. Constraints 
on both the size of the optical emission region (Yonehara 2001; Wyithe et al. 2000b) and 
the mid-infrared emission region (Wyithe, Agol, & Fluke 2002) have been established through 
microlensing. In this paper critical measurements of the H/? broad line and [O iii] narrow line 
ratios are added to the existing information on this lens which allows us to draw conclusions 
about the size of the broad line emission region (BLR) and the size of structures in the lens. 

In what follows, if it is not otherwise explicitly stated, the cosmological model is taken 
to be a flat one with a cosmological constant of = 0.7 and a Hubble parameter of 
Ho = 70 km s~^ Mpc~^. The lensing results are often expressed in term of the critical surface 
density defined as 'I^~^^^{zs,zi) = AirGDiDis/Ds where Dg is the angular size distance to the 
source, Di is the distance to the lens and Dis the distance between the lens and the source. For 
the case of Q2237+0305 Scrit = 9.8 x 10'^ Mq pc~^ in the above cosmology. The surface density 
in units of the critical density is denoted k. 

In the next Section the general method of spectroscopic gravitational lensing and its special 
advantages are outlined. In Section 3 the data reduction is described. The mcasiircmcnts of line 
strengths and magnification ratios are described in Section 4 along with their comparison with 
previously published measurements in other wavelengths. These measurements are interpreted 
in Section 6 in terms of the structures that could be responsible for them. A summary and 
discussion are given in Section 7. 

2. The Method of Spectroscopic Compound Lensing 

To remove much of the lens model ambiguities, a different method for detecting substructure 
lensing was proposed by Moustakas & Metcalf (2003) which avoids the problems arising from 
degeneracies in lens models, differential reddening in the optical images and scattering in the 
radio. This method utilizes the prediction that different emission regions of the QSO differ 
in size and thus should be magnified to differing degrees if substructure on a small scale is 
present. Comparing the magnification ratios in different wavelengths provides information on 
the structure of the QSO and the small scale structure of the lens that is independent of the 
large scale structure of the lens galaxy and halo. To describe this method we will first briefly 
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review what is known about the anatomy of a QSO. 

2.1. sketch of QSO anatomy 

The variability timescales of different spectroscopic features, and the time delays between 
them, have been used to establish the size and structure of different emission regions within 
the unified model for active galactic nuclei (AGN) and QSOs (c.f. Antonucci 1993). The basic 
picture consists of a supermassive black hole {M.f,h > IO^Mq) in the core, surrounded by an 
accretion disk, which produces the nearly flat-spectrum continuum light. The continuum flux 
varies on very short timescales, less than a day, and so must be very compact (around 100 AU 
or ~ 5 X 10~^pc). It is also known to be microlensed in the case of Q2237+0305 which puts 
an upper limit of 2,000 AU on its size (Yonehara 2001; Wyithe, Webster, & Turner 2000a; 
Wambsganss, Schneider, & Paczynski 1990). Beyond that, there are permitted lines such as the 
Balmer-series lines that are very broad (ufwhm > lO'^kms^^) due to gravitationally-induced 
motions. The characteristic size of this broad line region (BLR) apparently scales with the 
intrinsic luminosity of the host QSO (and therefore with the mass of the central black hole; 
Kaspi et al. 2000; Wandel 1999), such that in luminous QSOs (as opposed to, e.g., Seyfert I's), 
the size of the BLR is on the order of six light- months (or '~ 0.1 pc). According to long-term 
studies of QSOs, the typical (rest-frame) flux variations in the QSO continuum are on the order 
of 10-70% (though occasional fluctuations as high as ^ 50 x are possible; Ulrich, Maraschi, & 
Urry 1997), while the BLR variations are smaller by a factor of 2-4 (Maoz et al. 1994). 

The narrow emission lines (e.g. [O iii] AA4959, 5007) do not vary in flux signiflcantly over the 
time span of several years (Kaspi et al. 1996). For this reason, they have been used for calibration 
in long-term spectroscopic monitoring observations of low redshift QSOs for reverberation 
mapping experiments, by which the time delays between features of different ionization states 
may in principle be used to map all the components that make up QSOs (Peterson 1993). 
Photo-ionization arguments and cases where the narrow line region (NLR) is imaged directly, 
indicate that the size of the NLR extends out to 100-lOOOpc and is approximately proportional 
to the square root of the luminosity in narrow lines. 

A QSOs thermal emission in the mid-infrared is believed to come from a dusty torus that 
extents from a few parsecs to tens of parsccs. This belief largely comes from modeling the 
radiation transfer (for example Andreani, Franceschini, & Granato 1999) although recently a 
more direct lower limit on the size of this emission region has been made by comparing the 
microlensing of the optical continuum emission and the mid-IR (Wyithe et al. 2002). 

A quasar can emit in the radio on a large range of size scales. The central region of a radio 
loud QSO is usually resolved by VLBI (very long baseline interferometry) into a core and jet, 
or a core and lobes on a scale of 10 pc. The size of the core is limited from below by the 
Compton surface brightness limit which is generally in the 10 /xac range. 

2.2. the differential magnification ratio (DMR) 

Simulations show that if a few percent of the lens surface density is contained in substructures 
of mass ~ 10'' Mq then deviations of ~ 0.1 mag between the magnifications of the NLR and the 
BLR wiU be commonplace (Moustakas & Metcalf 2003). In addition, for the case of Q2237-I-0305 
the microlensing scale or Einstein radius for ordinary stars is particularly large so if the BLR is 
small it could be microlensed. Our goal here has been to measure the NLR and BLR ratios in 
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lens Q2237+0305 and determine if substructure is evident. 

To quantify our results we define the differential magnification ratio (DMR) 

where superscripts denote the image and subscripts the QSO emission region. The magnification 
is II and the observed flux is /. Note that the DMR is independent of the intrinsic QSO 
luminosity, the intrinsic NLR/BLR ratio and if the NL and the BL used are close enough in 
wavelength it will be essentially independent of extinction. In addition, we will define the 
quantity A^l/bl = -2.b\og(jiBLBj I^nlr) so that A^^^^^ - Af^^^g^ is a measure of the 
DMR in magnitudes. If there is no substructure all three independent DMRs in a 4-image 
lens should be close to one (a small deviation from one is possible because of the host galaxy's 
influence on the NLR; see Section 6.2). More generally the DMR can be used to compare any 
two measurements of the magnification ratios to see how well they agree. 



3. Data Reduction 

The J-band (Ac ~ 1.25 /um) spectroscopy of Q2237+0305 was performed with the Cambridge 
Infrared Panoramic Survey Spectrograph CIRPASS Parry et al. (2000) at the f/16 focus on the 
8-m Gemini-South telescope, at Cerro Pachon in Chile. CIRPASS is a near-infrared fiber-fed 
spectrograph, connected to a 490 clement integral field unit (IFU). The variable lenslet scale 
was set to 0'.'36 diameter, and the hexagonal lenslets are arranged in the IFU to survey an 
approximately rectangular area of 13'.'0 x 4'.'7. The detectors is a Ikxlk Hawaii-I HgCdTe 
Rockwell array. CIRPASS can operate in the range 0.9 — 1.8 /um, and a 4001/mm grating was 
used which produced a dispersion of 2.25 A/pix. The grating was tilted to place the wavelength 
range AA11300 — 13600 A on the detector, covering most of the J-band transmission window 
out to the atmospheric absorption between the J- and iJ-bands. A filter at 1.67 /um blocked 
out redder wavelengths to reduce the dominant near-infrared background contribution. The 
detector pixels do not quite critically sample the spectral resolution (unresolved sky lines have 
FWHM=1.7 pix), and the resolving power is A / AAfwhm = 3500. 

The observations were made on the night of UT 2002 August 10 during the Director's 
discretionary time instrument commissioning/IFU-demonstration science on Gemini-South. 
The observations spanned the airmass range 1.2 — 1.8. A total of 3 hours integration was 
obtained, split into individual integrations of 30 min per pointing, although the array was read 
non-destructivcly every 10 min (i.e., 4 times per pointing, including an initial read of the bias 
level). Each of these 'loops' comprised 10 multiple reads of the detector, which were averaged to 
reduce the readout noise (30e- per read) by VTO. These averages of each loop were subtracted 
from the average of the next loop of non-destructive reads, to form 3 sub-integrations of 10 min 
at the same telescope pointing. Comparison of these sub-integrations enabled cosmic ray 
strikes to be flagged, and a combined frame of 30 min was produced by summing the three 
sub-integrations, while masking pixels affected by cosmic rays. The seeing was « I'.'O FWHM, 
and the long axis of the IFU was set to 130 deg East of North. Windshake affected the guiding, 
and 2 loops were lost through severe trailing of the images. These were removed from the 
subsequent data reduction, so the final science integration was 160 min. In total, six overlapping 
pointings were surveyed, each covering all of the Einstein cross: positions were not repeated to 
improve background subtraction and reduce the impact of bad pixels. We 'nodded' the target 
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from the top (A) to the bottom (B) of the IFU every 1800 s, and introduced a small sub-dither 
of 2 lenslets between integrations. An initial subtraction of sky and dark current was performed 
using this beam-switching: for each exposure at location A, the average of the three spectra 
taken at B was subtracted, and vice versa. There was no overlap of the extended emission from 
the Einstein cross between positions A and B (a chop of Ri 8"). Known bad pixels were also 
interpolated over at this stage. 

The 490 fibers span the Ik detector, with 2 pixels per fiber. A lamp was used to illuminate 
10 calibration fibers immediately before the observations, in order to accurately determined 
the position of the fibers on the array and to focus the spectrograph at the desired wavelength 
range. There is significant cross-talk between adjacent fibers and we use an optimal-extraction 
routine^ (Johnson et al. 2003) to determine the spectrum of each fiber, for each pixel on the 
array solving the contribution of flux from adjacent fibers (knowing their relative positions). 

Immediately after the science integrations, spectral flat flelds were obtained (exposures of the 
illuminated dome), and these were also optimally-extracted and their average flux normalized 
to unity. The extracted science data was then flat-fielded through division by the extracted 
normalized dome lamp spectra, calibrating the response of each individual fiber. 

Wavelength calibration was achieved with an Argon arc lamp observed following the science 
exposures. Again, the individual fiber spectra were optimally extracted, and for each fiber a 
cubic fit was performed to the centroids of 40 arc lines, leaving rms residuals of 0.2 A. The 490 
fiber spectra stacked on the slit were rectified (mapped to the same wavelength) with a 5th order 
polynomial transformation in x and y. This transformation was applied to the beam-switched, 
extracted, flat-fielded science frames. A higher-order background subtraction was then applied 
to this rectified data, to remove sky residuals from the beam switching caused by variation in 
the OH-line intensity. 

The rectified, background-subtracted 2D spectra (with the dimension of wavelength and 
fiber number along the slit) were then converted to 3D data cubes, arranging the fibers into 
their physical positions in 2D on the sky, with wavelength being the third dimension. Because 
of the hexagonal close packing arrangement of the lenslets, alternate rows are offset by half a 
lens: in making the cubes this was accounted for by regridding each lenslet by two in the spatial 
dimensions (i.e. each lenslet is 2x2 sub-pixels of diameter O'.'IS). These six data cubes (from the 
six pointings of the telescope) were registered, shifting them spatially to the same location on 
the sky, and combined with a percentile clipping algorithm, rejecting bad pixels and residuals 
from cosmic ray strikes discrepant by more than 3 a. 

Flux calibration was obtained through observations of the standard star HIP036096 
(V = 8.89 mag, type B2V), taken on the same night at similar airmass. The standard star 
spectrum was reduced in the same way as the science data, and the total flux found by summing 
4x4 lenslets (l.Sarcsec diameter). Normalizing through division by this spectrum corrected for 
the severe atmospheric absorption at > 1.33 /im. The total effective throughput was determined 
to be ~ 8 per cent on the sky for the middle of the J-band. 
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Fig. 1. — The full CIRPASS spectrum at the eenter of QSO image A (above) and at the center of the lens galaxy (below). 
The broad H/3 line at 13097 A is clear in the QSO spectrum and in the galaxy spectrum there is only a hint of it. It is also clear 
that the continuum is steeper in the QSO and the Fe II "bulge" is apparent in the QSO spectrum near 12300 A. The prominent 
line at 1.28 /im is Pa/3 from the calibration star. 

4. Data Analysis 
4.1. line strengths 

To extract the line fluxes we fit a local continuum spectrum plus lines to each "spixel" in the 
image (i.e. the spectrum from each fiber corresponding to a unique spatial position). Example 
spectra from two spixels are shown in Figure 1. We are interested in the relative integrated flux 
within a line which should not depend critically on modeling the line shapes in detail. (In fact 
it is possible that microlcnsing might affect the line profile in detail without greatly affecting 
the total flux in the line in which case a fit to a smoother line profile would be preferable. 
See Section 6.1.1.) We start by fitting a local continuum that is linear in wavelength and a 
simple first order Edgeworth expansion^ to the H/3 line in the nine spixels around image A, the 
brightest image. This fit is done only in the range 12658-13424 A (4706-4990 A in QSO rest 
frame) to avoid the Fe ii complex as much as possible. The shape of the H/3 line in image A is 
then used as a template for all the other spixels. The spectrum at each spixel in the array is fit 
to a continuum level and slope along with a line strength. The result is displayed in Figure 3. 
In fitting the line and continuum several sky lines are masked out and along with the calibration 
star's Pa/3 at 12820 A, see Figure 1. The shape and level of the Feu complex varies greatly 
between QSOs and is thus very difficult to subtract (see Boroson & Green 1992). In all cases 
however Feii contributes very little to the flux between 12658 A and the H/3 line. There are 
three broad Feu lines blueward of H/3 at AA4924, 5018, 5169 A which are not identified in our 



^The CIRPASS data reduction software is available from http://www.ast. cam.ac.uk/"'optics/cirpass/docs.htmI 

^The Edgeworth expansion is a generalization of the Gaussian distribution where each addition term effects a higher moment 
of the distribution. The first order expansion has a skew. 
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Fig. 2. — Map of the integrated flux between 11305.1 A and 13613.7 A(or 4203 Ato 5061 Ain the QSO's rest-frame). The 
image is in the CIRPASS spixcl coordinates which arc rotated clockwise 50° from 8 R .A. coordinates. The squares mark the 
apertures used to measure the QSO fluxes as discussed in § 4.2. The conventional image labels are shown next to each aperture. 
Note that image B (the right most) is not prominent above the lens galaxy. The pixels are 0.18" in the horizontal direction 
and 0.156" in the vertical (the diflference is due to the hexagonal packing of the CIRPASS lenslets). The two spots below the 
central image are caused by cosmic rays. 

spectra. The 4924 A line could conceivably cause us to overestimate the slope of the continuum 

and underestimate the strength of H/?. We can estimate the maximum effect this could have by 
noting that the height of this line is always below the height of the AA4450 — 4700 Fe ii bulge 
which we can measure. This contribution to the H/? strength is rather small. We include it in 
the H/3 flux errors instead of attempting to subtract it. 

The full-width-half-maxima (FWHM) of the H/3 Hues are 4100, 3500, 3600 and 
3600±100 km s~^ for images A, B, C and D respectively - all consistent with each other except 
image A. The skewnesses of the images are all small and consistent with each other. There is 
further discussion of the change in the shape of the H/3 between images in Section 6.1.1. 

The sky absorption near the 5007A and 4959A [O iii] lines is large and sky line subtraction 
makes any fitting difficult. The 4959A line is not detected anywhere in the field possibly 
because of confusion with the Fe ii A4924 line. The 5007A line is very close to a subtracted sky 
line to its redward side, and to its blueward the noise introduced by flux calibration and high 
sky absorption is too large to accurately fit a continuum. To avoid these difficulties we fit the 
continuum in the range 13278-13469A (independently of the continuum used in the H/? fit) then 
subtract this from the sum of the flux in 7 pixels (in wavelength) on top of the line (rest frame 
5006-5010A). This avoids the sky line. No fitting of the line shape is done. 

The maps of line strengths in Figure 3 show the four QSO images distinctly and residual 
flux from the lens galaxy is at a low level although there are bridges between 3 of the images 
in [O III] at a level of 25% of the maximum. These connections go through the center of the 
lens galaxy and no arc connecting images is detected. The [O iii] NLR is consistent with being 
point-like at our resolution, ~ 0.6" in contrast to the C iii] A1909 NLR observed by Mediavilla 
et al. (1998) which shows an arc connecting images A, D and B. 

To evaluate the statistical errors in our line strengths we take the part of the IFU field 
that is well away from the lens system where we expect no line emission and go through the 
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Fig. 3. — A map of H/3 and [OIII] 5007 at the redshift of the QSO with the continuum subtracted spixel-by-spixel. The four 
QSO images are more distinct than in the total flux, Figure 2. Image B is clearly apparent here and the lens galaxy is not. 



same continuum and line fitting procedure. We find that the average line strengths in 1036 
such spixels is consistent with zero showing that there is no significant bias in our line fitting 
algorithm. The variance in the line strengths among these spixels is used as the statistical noise. 
There should be no significant signal-dependent noise since the data is dark current limited. 



4.2. measuring magnification ratios 

To find the total flux within a line coming from an image we must choose an aperture 
around each image. We use two choices for apertures. First we use square boxes five half-lenses 
wide (0.90" x 0.78") centered on the QSO image positions. This is close to the expected size of 
the PSF. The peak in the emission is found to be the center of image A and the other image 
positions are set using the high precision HST relative image positions (Kochanek et al. 2000). 
These are shown in Figures 2 and 3. To make certain that we are capturing all the fiux from 
each image we also extend the apertures as shown in the figures. The differential magnifications 
arc; given in Table 1 along with the separate NLR and BLR fiux ratios. The errors given are the 
statistical errors discussed above. There is little change to the DMRs when the apertures are 
extended further without overlapping each other signifying that leakage of flux outside of the 
apertures is not significant. 

Because the DMRs depend on which image is chosen to normalize them the important 
features to look for are the spread in the DMRs, how consistent they are with each other, 
and whether there is one outlier indicating that one particular image is being effected. The 
normalizing image's DMR - A = - should be included in this comparison. In table 1 image A 
is used to normalize the DMRs. For the other images the DMRs are negative and inconsistent 
with zero, but they are roughly consistent with each other. This indicates that the BLR in 
image A is anomalously bright. The simple magnification ratios in the BLR and those in NLR 
are also consistent with each other except those including image A which indicates that the 
differential extinction is not large. 
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Image 


B/A 


C/A 


D/A 


small apertures 


^NL/BL 


-0.82 ±0.06 


-0.89 ±0.06 


-0.93 ±0.05 




Anl 


-0.23 ±0.06 


-0.14 ±0.06 


0.09 ±0.05 




Abl 


-1.06 ±0.02 


-1.03 ±0.02 


-0.84 ±0.01 


extended apertures 


Anl/bl 


-0.74 ±0.05 


-0.87 ±0.05 


-0.81 ±0.05 




Anl 


-0.31 ±0.05 


-0.23 ±0.04 


-0.21 ±0.04 




Abl 


-1.05 ±0.02 


-1.11 ±0.02 


-1.02 ±0.01 



Table 1: Differential Magnification Ratios and Magnification ratios in magnitudes: These are normalized so that 
"^NL/BL ~ ^' Negative values indicate that image A is brighter in the simple ratios and that the BLR of image A is brighter 
in the DMRs. 
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Fig. 4. - Here are the DMR with respect to the radio and mid-IR fluxes normalized to image D. The NLR DMRs (i.e. 
^iiT I J- — / J- and A' , — A^?, ,,„) are marked by diamonds and the BLR DMRs by squares. The radio is at 3.6 cm 

NL/radio NL/radio NL/IR NL/IR-^ J 

and the mid-IR is a combination of 8.9 pxa. and 11.7 /im observations. The published \-a error bars are shown. 



4.3. Continuum 

It would be useful to compare the BLR and NLR fluxes with the continuum in our data. 
Extracting the QSO continuum flux requires subtracting the lens galaxy contribution and all the 
contributions from lines. An attempt was made to model the HST NICMOS image of the lens 
galaxy and degrade it to the PSF of the Gemini/CIRPASS data. We concluded that because of 
wind-shake, the PSF of our data was not well enough characterized to accomplish this to the 
accuracy we would like for this paper. 



5. comparison with radio and mid- infrared ratios 

Two of the goals of this work are to eliminate the uncertainty inherent in radio flux ratios 
due to scattering by ionized gas and to eliminate differential extinction uncertainties inherent in 
optical or near infrared observations. However, it will be shown that in this case these effects are 
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narrow line region broad line region 

image B/A C/A D/A B/A C/A D/A 

mid-IR/radio -0.42 ±0.11 0.35 ±0.15 -0.16 ±0.15 -1.16±0.10 -0.53 ±0.15 -0.97 ±0.14 

Table 2: The differential magnification ratios comparing the combined radio and mid-IR magnification ratio measurements 
to the NLR and BLR measurements. This is — ^nl and A^g^^jj^ — Abl- The NLR and BLR errors are 

not doubled here. The NLR B to A and C to A DMRs disagree by 0.77 mag. 



small. To do this and gain more information about substructure in the lens we construct DMRs 
by combining other observations with the NLR and BLR measurements. Since the sizes of the 
radio and mid-infrared emission regions are believed to be intermediate to those of the BLR and 
NLR this can give us additional information about the structure causing the anomalous DMR 
at the expense of these additional uncertainties. 

The radio fluxes in Q2237+0305 have been measured using VLA (Very Large Array) 
by Falco et al. (1996) at 3.6 cm, and Agol, Jones, & Blacs (2000) have observed it in the 
mid-infrared, 8.9 /um and 11.7 jim. Figure 4 shows the DMRs with respect to the radio and 
mid-IR measurements for both the NLR and the BLR. The uncertainties in these measurements 
are considerably larger than those in our line measurements, but important conclusions can 
be drawn from them. The BLR in image A is anomalously bright with respect to the other 
measurements while the NLR in image A is consistent with being magnified to the same 
extent as both the radio and mid-IR. The consistence in the mid-IR and radio ratios is further 
demonstrated in the left panel of Figure 5. This suggests that the radio and the mid-IR emission 
regions are being Icnscd by the same structure and thus arc similar in size which is in agreement 
with theoretical expectations, but, to our knowledge, has never been measured directly. It is 
also seen in Figure 4 and more clearly in the right panel of Figure 5 that both the radio and the 
mid-IR do not agree with the NLR - the image B NLR is relatively dim and the image C NLR 
is relatively bright. The difference between the largest DMR and the smallest DMR, the spread, 
is 0.77 mag. There is no clear outlier in the DMRs suggesting that the spread is not the result 
of an anomaly in a single image. Table 2 lists the DMRs comparing the combined radio/mid-IR 
with the NLR and BLR data. 

The differential colors for the images of Q2237+0305 arc in order AE{B-V) = -0.10±0.03, 
-0.17 ± 0.03, 0.01 ± 0.03 and (Falco et al. 1999). This implies that extinction would make 
the NLR in image B relatively dimmer than the others which is the opposite of what is found. 
Also, the expected magnitude of the extinction is relatively small in comparison to the DMRs 
observed - A{1.3iJ,m)/E{B — V) = 0.75 mag for the Milky Way giving a change in the DMR 
between images A and B of 0.05 mag. This does not exclude the possibility that the extinction 
is very patchy so that the continuum extinction is not appropriate for the NLR, but the small 
differences between the colors of the images argue against this being a large effect. 

The simplest interpretation of the anomalous DMRs in Figure 5 is that the mid-IR and radio 
emission regions are larger than the BLR and whatever is causing the extra magnification of the 
BLR in image A is too small to significantly change the magnification of the mid-IR, radio or 
NLR. Then there is some larger structures that are causing the NLR ratios to be incompatible 
with the mid-IR and radio ratios. 
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Fig. 5. — The likelihood functions for differential magnification ratios. In tlie left panel is the DMR between the mid-IR and 
radio ratios with respect to image A (DMR=0 means the ratio of mid-IR to radio is the same as in image A) using the published 
errors. All three DMRs are compatible with zero. In the right panel it is assumed that the magnification ratios in mid-IR and 
radio are the same and they combined into one set of measurements. The likelihood function between this measurement and 
the NLR measurements are plotted. The NLR errors are twice those reported in Table 1 to be conservative. The DMRs of 
image C and D are consistent with zero, but image B has a low DMR meaning that the radio/mid-IR is anomalously bright 
with respect to the NLR in this image. 

6. Interpretation 

Tliere are tliree possible factors contributing to the anomalous differential magnification 
ratios we observe. Microlensing of the BLR by stars in the lens galaxy could be changing 
its magnification ratios relative to those of the NLR, radio and mid-IR. This is discussed in 
Section 6.1. The size of the NLR could be big enough with respect to the host lens that its 
magnification differs from that of the other emission regions even without what would normally 
be classified as substructure. This is ruled out in Section 6.2. Finally, larger scale substructure 
(possibly made of CDM) either in the lens galaxy or along the line of sight is causing the 
mismatch between the NLR and radio/mid-IR magnification ratios. This is termed millilensing 
and is investigated in Section 6.3. 

An additional consideration that can be immediately ruled out is that the time delay between 
images is longer than the typical timescalc for variations in the source. Any reasonable lens 
model for Q2237+0305 gives time delays that are very short, typically less than a day. Recently, 
a tentative measurement of the time delay has been made using X-ray observations yielding 
2.7^0 9 hours between image A and B (Dai et al. 2003). This is well below the timescale on 
which the QSO is expected to vary in any of the wave-bands discussed in this paper. 
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6.1. Microlensing 

Q2237+0305 is exceptional among QSO lens systems in that its lens galaxy is very close to 
us (z=0.03). This makes the BLR particular susceptible to microlensing for two reasons. First, 
the Einstein radius of the host galaxy in physical units is roughly oc DigDi/Ds which reaches 
its maximum close to half way to the source. For this reason the images form closer to the 
center of the galaxy in Q2237+0305 where the star density is particularly high. In addition, the 
size of a star's Einstein radius relative to the size of the source goes as y/Di^D^/Di and so is 
exceptionally large for Q2237+0305. There is ample evidence that the QSO's optical continuum 
is being microlensing in image B of Q2237+0305 (Irwin et al. 1989; Ostensen et al. 1996; 
Corrigan et al. 1991; Wozniak et al. 2000; Wyithc et al. 2000b). Monitoring has shown that in 
the optical, image B went from being the second brightest image (hence the name) to being tied 
for the dimmest image over a two year period. We find image B to be dimmest in the NLR and 
the BLR. This gives some hint that the BLR magnification is correlated with the microlensing 
of the optical continuum. 

Further evidence of microlensing of the BLR is in Figure 4 where our data is compared 
to other data. The mid-infrared emission region is believed to be not much larger than the 
BLR. The lack of strong anomalies in the; mid-IR/radio DMRs suggests that the substructure 
is very small. In addition, the large anomaly in the BLR/NLR image A DMR implies compact 
and numerous substructures are responsible. Microlensing by stars seems like the most 
straightforward explanation. 

The microlensing interpretation is also consistent with theoretical expectations for this 
system. An upper limit on the standard deviation of the magnification ratio caused by 
microlensing for a finite sized source is Sm < 3|Ye^-| ^4r^ (Refsdal & Stabell 1997, 1991) 
where 6e{M^) is the Einstein radius for a star of mass M* = (M^)/(M) , the averaging 
is done over the distribution of star masses, and is the dimensionlcss surface density 
in stars. The surface density in smoothly distributed dark matter is k — k*. In this case 
6'e(M*) = 0.089 pc/Ds^/M^/ M© = 3.63 x 10-<^^M,/ Mq arcsec. Trott & Webster (2002) 
have modeled the mass distribution of Q 2237+0305 using the QSO image positions and a variety 
of direct observations of the lens galaxy. Their best fit model gives k — ~ 0.05 and ~ 0.04 
at the radius of the images. For a realistic value of M« = 0.5 Mq this gives 6m ^ 0.04 pc/i?NLR 
where i?NLR is the physical size of the BLR. 

The size of the BLR in some QSOs and Seyfcrt galaxies can be estimated using reverberation 
mapping. Combining the radius-luminosity relation and broad line width luminosity relation of 
Kaspi et al. (2000) gives i?NLR — 0.03(4108 km s~^/t'FWHM)^'^^ pc although there is significant 
scatter about this relation. With the H/3 line widths reported in §4 this gives i?NLR ~ 0.05 pc, 
small enough that the the observed magnification of image A can be explained by microlensing. 

The time scale for changes in the microlensing magnification is set by the size of the BLR and 
the velocity of the stars in the galaxy relative to the image. The BLR is of order a light-month 
in size and typical speeds for the galaxy and rotation of the its disk are ^ 200 km s^^. This 
gives a time scale of ^ 100 yr. This is in contrast to microlensing of the visible continuum where 
the time scale is about 100 times smaller (Wyithe et al. 2000b, 2000a). 
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Fig. 6. — Comparison of the H/3 line in image A (dark curve) to that in image B (light curve). The line in image A is slightly 
wider than the line in image B. This could be caused by a blending of the emission from the NLR and the BLR combined with 
differing magnifications or by microlensing of kinetically distinct regions of the BLR. 

6.1.1. H(3 Line Profiles 

Several groups have investigated the possibihty of detecting microlensing and learning 
something about the structure of the BLR through distortions in the profile of the broad lines 

(Nemiroff 1988; Schneider & Wambsganss 1990; Abajas ct al. 2002). If the BLR has a coherent, 
large scale velocity structure (such as rotation or uniform infall) microlensing could change the 
shape of a broad line in the image that is being microlensed. This effect comes about because 
the magnification can vary across the image of the source and the emission region for a narrow 
range of wavelength can be much smaller than the whole NLR. It was generally thought that if 
microlensing would have an insignificant effect on the total magnification of the BLR of a bright 
QSO. 

Wc find that the HZ? line is wider in image A with respect to the other images. Figure 6 
shows the line in image A and B with the continuum subtracted and normalized so they have the 
same height. The change is not large and there is no significant evidence that the asymmetry of 
the lines are different in the different images. In appendix A an upper limit to the contribution 
a wavelength dependent component to the magnification is derived. More than half of the 
anomalous magnification in image A must be caused by a magnification that is uniform over the 
velocity structure of the H/? emission region. 

The extra width of the image A II/3 line can be interpreted as either caused by the 
microlensing of a small scale velocity structure in the BLR or as a sign that there is a 
narrow-line component to the II/3 that is not being magnified as strongly in image A as the 
broad-line component. The microlensing explanation would not in general produce a symmetric 
broadening like the one we see, but it is possible that it would. We conclude that there is no 
strong evidence for coherent large scale motion in the H/3 BLR, but that this possibility cannot 
be ruled out. 

Filippenko (1989) found that the broad Mgll A2798 line had a slightly asymmetric distortion 

in its profile between the different QSO images of Q2237+0305 which could be attributed to 
microlensing. It is known that the BLR for high ionization states is significantly smaller than 
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Fig. 7. A model of Q2237+0305 with the regions used for measuring the magnification ratios shown. In this case the source 
has a Gaussian profile of size Ro.i = 526 pc (full width for 10% peak surface brightness). The lowest contour is at 1% of the 
peak surface brightness and they go up in intervals of 10%. It can be seen that differences in the ^nl/bl'^ from zero are not 
caused by loses outside of the integration regions. 

the H/J BLR (see Peterson & Wandel 1999, for an illustration of this) thus more susceptible to 
microlensing and possibly more coherent in its velocity structure. 

6.2. NLR size effects 

There are two ways in which the size of the NLR might change the NL/BL ratio between 
images without substructure. First, if the effective size of the NLR is close to the size of the 
aperture used to measure the flux the NLR/BLR ratio will be affected. Because the sizes of the 
NLR arc different in each image it is possible some of the flux cotild leak out of the aperture 
causing the more magnified images to have lower NLR to BLR ratios. This possibility can be 
eliminated by increasing the size of the apertures and seeing in the NLR fluxes go down as is 
the case for our extended apertures. 

The second possibility is that the NLR is large enough that the magnification cannot be 
considered uniform over the whole image and thus its total magnification will differ from that 
of the smaller BLR. To address this issue we model the lens and the NLR for Q2237+0305 and 
measure the ratios in mock data. The lens model consists of a elliptical halo with a power-law 
radial profile, a exponential disk and an external shear. The model reproduces the image 
positions and the radio magnification ratios. The same apertures are used in the simulations 
and with the data. Figure 7 shows an example of one such simulation. We use two models for 
the surface brightness profile of the NLR - an exponential and a Gaussian. The Gaussian is 
meant to represent the case of a NLR with a sharp edge while the exponential is more diffuse. 
The profile of the NLR of NGC 4151, probably the best studied example in a Seyfert galaxy, is 
roughly exponential although it does show significant irregularities and asymmetries (Robinson 
et al. 1994). 

The size of the NLR in C iii] A1909 was inferred from modeling the arc connecting images 
A, B and D by Mediavilla et al. (1998). They estimate the size of the BLR in this wavelength 
to be 340 — 570h~^ pc where this is the total extent of the observed emission. Direct HST 
measurements of QSOs in [O iii] indicate that the the NLR could be a few kpc in total extent 
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Fig. 8. Tlic DMRs witli respect to image B are plotted as a function of source size, -Ro.i- These are calculated using the lens 
model and the extended apertures discussed in the text. The source profile is Gaussian in this example. The hashed regions 
are within la of the measured values. 

(Bennert et al. 2002). 

Figure 8 shows the results of these simulations along with the observed DMRs. The size of 
the source is measured by i?o.i which is the radius at which the surface density drops below 
10%. For comparison 1" = 24. 5/1^5^ kpc on the source plane and SSbh^^ pc on the lens plane. 
For a Gaussian source profile the DMR does not diff'er greatly from one (or zero in magnitudes) 
if i?o.i ~ 4 kpc. For a exponential profile the same is true for Rq.i ^ 2 kpc. And when the NLR 
is large enough to change the DMR it is in the opposite direction to the observed A to B DMR. 
We conclude that the finite size of the NLR could account for the low (but not very low) C to B 
and D to B DMRs, but not for the high A to B DMR. 

6.3. Millilensing 

The disagreement between the NLR and radio/mid-IR magnification ratios is most easily 
explained by the presents of substructure with a mass m ^ Mq . We shall sec that the quantity 
of substructure required to make this discrepancy probable makes substructure made of CDM 
a more likely explanation than substructure composed of ordinary gas and stars such as dwarf 
galaxies or globular clusters. To determine the level and properties of the substructure that are 
compatible the measured DMRs we carry out extensive lensing simulations. 

6.3.1. simulations 

For the results that follow, the magnification distributions were calculated using an adaptive 
grid code. The strategy is to first calculate the image positions, through the lens equation. 
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on a coarse grid over the image plane. The deflection angle is found by summing over all the 
subclumps in this initial region. The grid is then refined to cover smaller regions surrounding 
each of the images. The grid is made finer when the image begins to cover a large fraction of 
the gridded region. This process is repeated until enough grid points are within the image to 
calculate its area to better than 1% accuracy. The magnification is then the ratio between this 
area and the original area of the unlensed source. 

If the substructure is massive ( ^ 10^ Mq) it can change the position of the images enough 
that the contribution of the host lens to the image magnification can change. In addition, a 
single one of these large substructures can affect the magnification of more than one image. 
For these reasons we simulate the whole lens (host lens plus substructure) at once. In this 
way we can measure the positions of the images which become significant constraints on the 
substructure when its mass is large. 

CDM simulations predict that most of the substructure at a projected galactic radius of a 
few kpc will be at a three dimensional radius of a few tens of kpc (Zentner & Bullock 2003). The 
substructure will then be distributed fairly uniformly on the scale of the host's Einstein radius. 
For this reason we do simulations with a constant number density of substructures. We do not 
characterize the simulations by the mass fraction in substructure because this can vary by a 
factor of several between the different images owing to the differences in density at small radii. 
The substructures are modeled as singular isothermal spheres (SISs) tidally truncated as if they 
are at a distance of 30 kpc from the center of an isothermal galaxy. The lensing is not critically 
dependent on the details of this truncation. All the substructures are of the same mass. In a 
more theoretically motivated model there would be a distribution of substructure masses, but in 
this paper we are seeking to put empirical constraints on the substructure mass scale. 

6.3.2. radial profile degeneracy 

There is a degeneracy between the level of substructure and the radial profile of the host 
lens as discussed in Metcalf (2003) . The effect of changing the radial profile is investigated by 
adjusting the host lens model. We start with an elliptical lens model with an k{p) oc (where 
p in coordinates that are aligned with the axis of the lens is = a;f + a;2/(l — e)^) profile that 
fits the image positions very well. This is known as a singular isothermal ellipsoid (SIE) lens 
model. We then add a uniform surface density, a, to the lens and adjust the normalization of 
the ellipsoidal component so that the image positions do not change. This is always possible 
and has no effect on the magnification ratios, but it does change the total magnification such 
that flatter profiles have higher magnifications and it changes the time delays such that flatter 
profiles have shorter time delays. The total magnification is unobservable and the time delay 
has not been measured for Q2237+0305. These degenerate models are labeled by their effective 
logarithmic slope, ries, which is related to the the added surface density by ries = {a — 1). For 
further discussion see Metcalf (2003). 

For each choice of substructure properties and host lens model we run simulations with three 
different source sizes - 10 pc which is appropriate for the radio and mid-IR emission regions and 
for the NLR we use 200 pc and 400 pc for all. This size should be viewed as an effective size for 
the purposes of lensing not the full extent of the NLR which must be larger. 
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Fig. 9. — The probability that the spread in the DMRs, S, will be greater than the observed 0.77 mag, P(<S > 
0.77 I Kgubi ™off I ™)i with substructure mass m = 10'* Mq. The different curves are for host lens models with different radial 
profiles (see Section 6.3.2). The horizontal dotted line marks the 5% level for reference. To convert the surface densities into 
mass fractions in substructure see Table 3. 
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Table 3: The surface densities at each of the images in units of Scrit (= 9-8 x 10'^ Mq pc^^ in the Ho = 70 km s^-'^ Mpc~^, 
Ha = 0.7, fimatter = 0.3 cosmology. These values differ from those in the lens model of Schmidt, Webster, & Lewis (1998) at 
the 10% level. The rieff = ~1 model is the singular isothermal ellipsoid lens model. 



6. 3. 3. results 

We find that the data are best suited to putting a lower limit on the surface density of 
substructure as a function of the substructure mass and the host lens profile; this is the most 
robust and conservative way to treat the data. To quantify this we use the spread statistic. Wc 
define the spread, S, as the difference between the largest DMR and the smallest, including the 
normalization image's DMR (A = 0). This statistic is clearly independent of which image is used 
to normalize the DMRs. In the absence of substructure and noise <S = 0. As shown in Section 5 
the measured spread is S = 0.77 mag. Wc calculate the probability of getting an observed 
spread as large as this or larger given the level of substructure and the noise, P{S > 0.77). This 
is done by Monte Carlo, using the simulations to predict the flux of each image and then adding 
random noise. At least 2,000 lensing simulations are calculated for each choice of substructure 
parameters and for each simulation 500 realizations of the noise arc made. The results do not 
change when these numbers are increased. To be conservative we use twice the NLR flux errors 
derived from the data and reported in Table 1. 

Figures 9 through 13 show the probability P{S > 0.77) for different substructure masses. 
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Fig. 10. — Same as in Figure 9 only with substructure mass m = 10^ Mq. Note that the scale is different and the probabilities 
axe generally larger. 
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Table 4: Minimum substructure surface densities required to have a better than 5% chance of observing a DMR spread as 
large as the one observed. The cases marked "> 0.06" did not reach this limit within the range of surface densities tested, those 
marked "— " appear as though they cannot reach this limit for any Kgub ^^'^ those that are blank where not tested because the 
images tended to merge in too many cases. 



Figure 9 is for a mass of 10* Mq. In this case the substructures are not large enough to change 
the magnification of the NLR and only rarely can significantly change the magnification of the 
10 pc source representing the radio and mid-IR sources. The probability is correspondingly 
low with none of the models going above 5% probability for Kguh < 0.06. We consider this 
low probability unacceptable thus ruling out m = 10* Mq substructure with Kgub < 0.06 as an 
explanation. To translate these values into mass fractions Table 3 gives the surface densities at 
each image in the different models. 

Figure 10 shows the probabilities for mass 10^ Mq. The probability is larger here than for 
10* Mq, but still not significant unless the host halo is flatter than an SIE. The UeS = — 1 
model, the most common assumption in lensing, is ruled out for the range of Ksub tested. 
Our maximum surface density of Ksuh = 0.06 is still very large compared to the some CDM 
predictions of less than 0.1% at this mass scale (Zentner & Bullock 2003). The ries = —0.4 case 
reaches our thresh-hold of 5% at Hguh = 0.047 making this the lower limit in this case. 

A mass of 10^ Mq is investigated in Figure 11. In this case for substructure surface densities 
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Fig. 11. — Same as in Figure 10 only with substructure mass m = 10'' Mq. 

Ksub > 0.029 the observed DMR discrepancy occurs more than 5% of the time in all the host 
models tested. At the 10% level Ksub > 0.058 in the most common UeS = —1 model. For flatter 
models this lower limit is reduced significantly. These are more reasonable mass fractions since 
Ksub = 0.02 corresponds to /sub — 0.05 at image A in the UeS = — 1 model and less in the flatter 
models. However, at this mass scale this is still rather large compared to the current CDM 
predictions. 

Figure 12 shows P{S > 0.77) for substructure of mass 10^ Mq. In this case the substructure 
is big enough to occasionally shift the image positions by a few tenths of an arcsec. Lens 
models for Q2237+0305 using the observed distribution of stars have been very successful at 
reproducing the image positions (Trott & Webster 2002; Schmidt et al. 1998). For this reason 
any substructure cannot change the image position by very much. In Figure 12 limits on the 
allowed shift in image positions are imposed to express this additional constraint. We adopt 
a conservative limit of 0'.'2 as a test. Results with and without the image shift constraint are 
plotted in Figure 12 and it is seen that this extra constraint does not change the lower limits on 
Ksuh- The most striking result displayed in Figure 12 is that the rieff = — 1 is unlikely to produce 
the observations for any substructure surface density that is reasonable from the prospective of 
current CDM model predictions (^Csub — 0.06 would be 15% of the mass at image A). In fact, 
P{S > 0.77) does not go above 15% within the range of Kgub investigated unless riefi > —0.4. In 
some cases we can put an upper limit on Kgub and they are reported in Table 4. 

Figure 13 shows P{S > 0.77) substructure of mass 10^ Mq. Here all the models are ruled 
out at the 90% level and the only way to get above the 95% level is to abandon a limitation 
on the shift of the image position which we think is not realistic. High mass substructure 
is not compatible with the observed DMRs because it is not effective in changing the small 
and large scale magnification ratios independently - when it magnifies (or demagnifies) the 
radio and mid-IR images it also magnifies (or demagnifies) the NLR image. This is further 
conformation that anomalous DMRs must be the results of small scale structure and not a large 
scale distortion of the host lens. In addition, the 10^ Mq substructure tends to displace the 
images on the scale of several tenths of an arcsec although this mass range is ruled out even 
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Fig. 12. — Same as in Figure 11 only the substructure mass is 10^ Mq and two limits on the shift of the image positions are 
shown. For each n^g the lowest curve is for a 0'.'2 limit and the highest is for no limit. For the n^ff = —1, —0.8 and —0.6 the 
0'.'2 limit is indistinguishable from no limit. Note that scale in this plot is not the same as in Figure 11. 



without this constraint. 

We have done these calculations with source sizes of 200 pc and 400 pc representing the 
NLR. Changing the size makes only small changes to P(5 > 0.77) at large mass and large rieff. 
These changes do not significantly change the conclusions. For Figures 9 through 13 the NLR 
source size is set to 200 pc. The lower limits on Kgub for all the models tested are shown in 
table 4. 

The error estimates that have been derived for our observations and those published with 
the mid-IR and radio observations are crucial for the estimation of the surface density bound. If 
the errors are underestimated the Kguh lower bound should be lower. For example, if we increase 
the radio errors of Falco et al. (1996) by half the Ksub lower limit in the tIcs = —1, m = 10^ Mq 
model goes from 0.029 to 0.021. Increasing the mid-IR errors (Agol et al. 2000) in this way 
changes this limit to 0.016. On the other hand if the empirically derived errors on our NLR 
measurements are used instead of doubling them the limit is increased to 0.055. 

To squeeze more out of the data we can also calculate likelihood functions for the substructure 
parameters although this is somewhat less robust then the P{S > 0.77) analysis done above. 
The likelihood function for set of parameters {Wsubi nes, m} is given by 

£(Ksub, ries, = y c^{DMR} P^ub ({OMR}, {x} | K^ub, J^eff , m) (2) 

xPob ({DMR„6}, {x„6} I {DMR}, {x}) 

where Pot expresses the instrumental uncertainties which is taken to be a normal distribution. 
Uniform prior distributions have been assumed for the parameters. Again the integral is done 

by Monte Carlo. 

Figure 14 shows the likelihoods as functions of substructure mass in several ranges of 
substructure surface density. Plotted this way it is clear that the data favors a substructure 
mass of ~ 10^ Mq for fixed Ksub- Here the errors in the NLR magnification ratios have been 
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Fig. 13. — Same as in Figure 12 except the substructure mass is 10** Mq. None of the models reached above 0.065 probability 
that the DMRs spread will be greater than 0.77 mag. 



set to twice those in Table 1 to be conservative. The hkelihood functions with the measured 
statistical errors are shown in Figure 15. The shape of the likelihood function is very similar, 
but the extreme masses are more strongly ruled out. 

In summary, the data puts a lower limit on the amount of substructure that is present in 
Q2237+0305. Table 4 gives these limits for the ranges of parameters tested. Given the present 
CDM simulations these limits seems uncomfortably large. With the expected substructure level 
of < 1% of the surface density in subclumps of mass less than 10^ Mq the host lens profile as 
steep as k{p) (x p^^ is unacceptable. We also find that a substructure mass scale of ^ 10^ M0 is 
most effective at changing the observed DMRs. Substructure of mass 10** Mq is too small and 
10^ Mq is too large. The details of the lower mass limit is dependent on our assumption that 
the radio and mid-IR emission regions are 10 pc in radius, however to the precision we are now 
concerned with this should not be a major factor. 

It has been assumed here that the substructure is within or near the lens galaxy. It is 
possible that the responsible objects are in intergalactic space either in front of or behind the 
lens. Within the CDM model this is less probable, but not improbable. The above constraints 
can be crudely converted to a redshift z by scaling the substructure mass and surface density 
by a factor of Ecrit(-2^s) -Zi)/^crit(-2s) z) = ^ j^jjjj'Ij^y ^ ■ This factor reaches a minimum of 0.6 at 

.2=1.1. 



6.3.4- The contribution of known objects 

There are known objects in our galaxies and others which could conceivably be causing the 
observed anomalies in the DMRs. One possibility is globular clusters. However, the total mass 
in globular clusters around the Milky Way is only ~ 10'' — 10^ Mq so their surface density is 
two order of magnitude below the lower limits reported here. 

Another contribution is from Giant Molecular Clouds (GMCs) of mass ~ 10^ Mq which 
are known to contain most of the H2 gas mass in the Milky Way. The surface density of H2 
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Fig. 14. — These are the HkeUhood functions for four ranges of substructure surface density as marked in each panch The 
curves are marked in the same way as in Figures 9 through 13 — solid: riefi = —1, dotted: n^ff = —0.8, dashed: n^g = —0.6 and 
dot-dashed: ngfj = —0.4. The hmit in the acceptable shift in the image positions is 0'.'2. The normalization is arbitrary, but all 
the models are normalized in the same way so that they can be compared between panels. A substructure mass of ~ 10^ Mq 
is clearly favored in all cases. 



at galactic radius ~ 1 kpc (where the images in Q2237+0503 form) is only a few Mq pc~^, 
well below our lower limits (Blitz 1997). The H2 density in M31 no more than ~ 5 Mq pc~^ 
anywhere (Loinard et al. 1999). However, in some galaxies the peak H2 density does exceed our 
lower limit (Heifer et al. 2003). The H2 density is truncated at an inner radius of several kpc. 
This is believed to be caused by the galactic bar. The same phenomena is seen in other barred 
spirals. In Q2237-I-0503 the lensing is almost entirely caused by the bulge and bar of the galaxy 
through which the images pass (Trott & Webster 2002) . For this reason wc expect the density of 
GMCs to be well below our limit. In addition, GMCs are not very centrally concentrated, they 
are generally held up by turbulent pressure and have many, much smaller mass, condensations 
sprinkled throughout them. For this reason GMCs make inefficient lenses as compared to the 
SIS model used in Section 6.3.1 through 6.3.3. The GMCs would need more density by a factor 
of a few to reproduce reproduce the lensing observations. 

Another argument for a small surface density in GMCs is the extinction. By adopting an 
upper limit on the absolute luminosity of the QSO of Mb > —29 and an upper limit of 20 for 
the lens magnification (the SIE+shear model predicts 16) an upper limit for the extinction of 
Ay < 2.7 mag can be derived (Falco et al. 1999). The extinction in the Milky Way is roughly 
related to the hydrogen column density through N{H I) + 2N{H2) — 13 Ay Mq pc~^ (Savage 
et al. 1977). Conservatively assuming all the hydrogen is in molecular form this gives an upper 
limit of N{H2) < 18 Mq pc~^ which is well below our lower limit on substructure. Falco et al. 
(1999) have argued that the above relation between hydrogen column density and extinction 
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Fig. 15. — The same as in Figure 14 only the errors for the NLR magnification ratios reported in Table 1 are used instead of 
doubling them to account for unknown systematic errors. The normalization is arbitrary and different from that in Figure 14. 



does not hold for some of the lens galaxies they studied in the sense that it over estimates the 
column density which makes our upper limit even more conservative. 

For the above reasons we do not think GMCs are a likely explanation for the DMR anomalies, 
but they are probably the largest potential "background" and we plan to investigate their lensing 
properties more thoroughly in the future. For lenses with elliptical galaxies the majority of 
QSO lenses - GMCs are much less likely to be a source of lensing substructure. Analyzing more 
lens systems can definitively rule of GMCs. 



7. Discussion 

We have demonstrated the spectroscopic gravitational lensing method and shown the 
importance of having high quality data in a large range of wavelength. Using the CIRPASS 
instrument the magnification ratios of the QSO's narrow lines region and broad line region were 
measured. It was found that these do not agree with each other or with the published radio and 
mid-IR magnification ratios. The disagreement between the BLR magnification ratios and the 
radio/mid- IR magnification ratios is interpreted as microlensing of the BLR. This implies that 
the mid-IR emission region is larger than the BLR and the BLR is 0.1 pc in size. We find a 
small difference in the shape of the H/3 line in image A when compared to the other images. This 
difference is too small and symmetric to be considered evidence for rotation or large scale infall 
in the H/3 emission region. The disagreement between the radio/mid-IR ratios and the NLR 
ratios is interpreted as a signature of substructure. Lensing simulations are performed to obtain 
a lower limit on the amount and mass of substructure that is required to cause this discrepancy. 
The substructure content is degenerate with the radial profile of the host lens, but if the 
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expectations of the CDM model are taken into account certain radial profiles and substructure 
surface densities can be ruled out. The substructure mass scale as large as 10* M0 and 10^ Mq 
is disfavored while 10^ Mq is too small given our assumptions about the radio/mid-IR source 
size. The standard elliptical singular isothermal lens mass profile is disfavored in comparison to 
a more flattened profile with the same substructure content. 

Although the anomalous differential magnification ratios in Q2237+0305 are very unlikely to 
be caused by a substructure as large as 10* Mq this does not mean that the mass function of 
substructures does not extend up to this mass scale. Large mass substructures could dominate 
the total mass in substructure and yet be very few in number so that one would expect many 
lenses to be unaffected by them; in other words the shot noise from lens to lens is expected to 
be large at high masses. If the responsible object is intergalactic and unassociated with the lens 
galaxy the mass bound is different, but in this case the upper mass bound is more likely to go 
down then up because the lens galaxy is unusually close to us and not near the minimum of 

It should be mentioned that flattening the host lens' radial mass distribution generally 
reduces the Hubble parameter as measured by the time delays. The X-ray measurement of the 
time delay (2.71° g hours between image A and B) is tentative (Dai et al. 2003), but it does 
indicate that Q2237-t-0305 has a near surface density profile if the HST Key Project's value 
for the Hubble parameter is used (72 ± 8 km s~^ Mpc~ ; Preedman et al. 2001). The fiducial 
SIE+shear model used here gives a time delay of 3.3 hours and the delay is proportional to 
n~g as defined in section 6.3.2. In general the measured time delays in other lenses lead to a 
conclusion that either the Hubble parameter is smaller than the above value or the profile is 
steeper than r^^. Kochanek (2003) has highlighted this as a problem for the CDM model. This 
is an unresolved and intriguing problem which becomes even more so with our results since 
the steeper the radial profile the more substructure is needed. It could be that in looking for 
substructure to confirm CDM we have found too much. 

Clciarly more lens systems need to be studied in this way to put stronger constraints on dark 
matter substructure and eliminate systematics. The prospects for improvement are good in this 
respect since Q2237-I-0305 is not an ideal system for this work - the [O iii] lines are in a region 
of high atmospheric absorption and because of the proximity of the lens galaxy microlensing 
is enhanced. The situation can be greatly improved with just a few more systems observed in 
narrow lines and the mid-IR and/or radio. One of the most important systematic uncertainties 
we encountered is how to construct the apertures around each of the images for measuring the 
magnification ratios. The improved spatial resolution that would come with using a space-based 
observations (or with adaptive optics-enhanced spectroscopy from the ground) would be of great 
benefit in reducing this uncertainty. 

A. Contribution of Wavelength Dependent Magnificat ion 

It is possible that the magnification of the BLR is not uniform as would probably be the 
case if microlensing is important and the BLR size is of order the Einstein radius of the stars 
in the lens galaxy. This possibility can be tested if the BLR has a coherent large scale velocity 
structure as in the case of a rotating disk or spherical infall. In this case a small scale variation in 
the magnification would distort the the broad line in one or more of the images. This effect was 
proposed by Nemiroff (1988). The contribution of such a wavelength dependent magnification 
to the differential magnification ratio can be assessed by comparing the line profiles from the 
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Fig. 16. The line profile distortion parameter, equation Al, between image A and the other images. The central peak is 
0.3 mag which indicates that the wavelength independent contribution to the NLR/BLR differential magnification ratio is at 
minimum 0.3 mag. 



two images. 

If the wavelength of the zth pixel of a spectrum is A,; then the flux in this pixel is 
F{Xi) — /i(Ai)/(Aj) where f(Xi) is the flux before magnification and /i(Ai) is the magnification 
at that wavelength. The total flux from the BLR is = ^iF{Xi). We will assume that one of 
the BLR images is uniformly magnified, i.e. the magnification is not a function of wavelength. 
To be specific wc will label this image B although it could be any image or an average of some 
of the images. For this image (Xi) / (Xg) = /(Ai)//(Ao) where Xo is a reference pixel that 
can be varied. Because the pre-lensed line profile, /(Ai), is the same for all the images we can 
calculate the change to the total flux in image A caused by a frequency dependent magnification: 

- F^(X )F^ 

«(«-^ = |;r{^-i (Al) 

where F^ is what the total flux would be if the magniflcation were uniform and equal to 
/x'^(Ao). Another way to think of this quantity is to note that if you wanted to correct the 
observed differential magnification ratio (with the sign convention used in this paper) to what it 
would be if the BLR magnification ratio where uniformly jj,^ / ii^{Xo) then one would subtract 
2.5 log[i?(Ao) + 1] magnitudes. After subtracting the continuum and masking any sky lines 
equation (Al) can be applied directly to the data to determine if a wavelength dependent 
component of the magnification is causing a significant change in the line flux. Figure 16 shows 
this for our data. 

Microlensing is not the only thing that could affect the line profiles. Within the broad line 
there is also a contribution from the NLR. This NLR contamination will reduce the magnitude 
of the differential magnification ratio, diluting any signal from substructure. It will also 
make images with relatively bright BLRs have a wider line profiles as is observed in our data 
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(Figure 6). If we use a simple model where there is a distinct NLR and BLR in H/3 (which is 
probably not the case) and we denote their magnifications as /xnl and /^bl respectively then the 
expected shape parameter will be 

(lo(Xo) + r^jiuj + r^) 

where u = ft^h/fBh and r = /xbl/a*nl- Note that if the NLR and BLR components of the line 
are both symmetric then this effect will not make the line asymmetric. More generally, since the 
NL is narrow by definition the line profile should only be modified in the central region. 

It is also interesting to note that in the limit that w — s- while the central wavelength is still 
dominated by the NL (w(Ajj^) oo) R{X^^) = DMR — 1. With our data this would give an 
estimate of DMR = 0.3 mag between A and B images - still well above the noise. This is a 
lower limit since as the dominance of the NL at the central wavelength, w(Ajj^), is reduced the 
implied DMR becomes larger. 
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